A Nonparametric Regression Approach to Control for Population Stratification in Rare Variant Association Studies
نویسندگان
چکیده
Recently, there is increasing interest to detect associations between rare variants and complex traits. Rare variant association studies usually need large sample sizes due to the rarity of the variants, and large sample sizes typically require combining information from different geographic locations within and across countries. Although several statistical methods have been developed to control for population stratification in common variant association studies, these methods are not necessarily controlling for population stratification in rare variant association studies. Thus, new statistical methods that can control for population stratification in rare variant association studies are needed. In this article, we propose a principal component based nonparametric regression (PC-nonp) approach to control for population stratification in rare variant association studies. Our simulations show that the proposed PC-nonp can control for population stratification well in all scenarios, while existing methods cannot control for population stratification at least in some scenarios. Simulations also show that PC-nonp's robustness to population stratification will not reduce power. Furthermore, we illustrate our proposed method by using whole genome sequencing data from genetic analysis workshop 18 (GAW18).
منابع مشابه
Assessing the impact of population stratification on association studies of rare variation.
AIMS The study of rare variants, which can potentially explain a great proportion of heritability, has emerged as an important topic in human gene mapping of complex diseases. Although several statistical methods have been developed to increase the power to detect disease-related rare variants, none of these methods address an important issue that often arises in genetic studies: false positive...
متن کاملLeveraging population information in family-based rare variant association analyses of quantitative traits.
Confounding due to population substructure is always a concern in genetic association studies. Although methods have been proposed to adjust for population stratification in the context of common variation, it is unclear how well these approaches will work when interrogating rare variation. Family-based association tests can be constructed that are robust to population stratification. For examp...
متن کاملFine-Scale Patterns of Population Stratification Confound Rare Variant Association Tests
Advances in next-generation sequencing technology have enabled systematic exploration of the contribution of rare variation to Mendelian and complex diseases. Although it is well known that population stratification can generate spurious associations with common alleles, its impact on rare variant association methods remains poorly understood. Here, we performed exhaustive coalescent simulation...
متن کاملRare and Low Frequency Variant Stratification in the UK Population: Description and Impact on Association Tests
Although variations in allele frequencies at common SNPs have been extensively studied in different populations, little is known about the stratification of rare variants and its impact on association tests. In this paper, we used Affymetrix 500K genotype data from the WTCCC to investigate if variants in three different frequency categories (below 1%, between 1 and 5%, above 5%) show different ...
متن کاملThe Soluble Carrier 30 A8 (SLC30A8) Gene Polymorphism and Risk of Diabetes Mellitus Type 2 in Eastern Azerbijan Population of Iran
Type 2 Diabetes Mellitus (T2D) is the most common metabolic disease demonstrating itself by hyper- glycemia, due to impaired insulin secretion or action. Recently, Whole-Genome Association studies have revealed the role of several new genes responsible for T2D. One of the most studied genes is SLC30A8 (Zn-T8) which is exclusively expressed in pancreatic ?-cells and participates in insulin stora...
متن کامل